# Hierarchical Token Compression
Internvl 2 5 HiCo R64
Apache-2.0
A video multimodal large language model enhanced by Long and Rich Context (LRC) modeling, improving existing MLLMs by enhancing the perception of fine-grained details and capturing long-term temporal structures
Video-to-Text
Transformers English

I
OpenGVLab
252
2
Internvl 2 5 HiCo R16
Apache-2.0
InternVideo2.5 is a video multimodal large language model (MLLM) built upon InternVL2.5, enhanced with Long and Rich Context (LRC) modeling, capable of perceiving fine-grained details and capturing long-term temporal structures.
Video-to-Text
Transformers English

I
OpenGVLab
1,914
3
Featured Recommended AI Models